Application Recovery in Parallel Programming Environment
نویسندگان
چکیده
In this paper, fault-tolerant feature of TOPAS parallel programming environment for distributed systems is presented. TOPAS automatically analyzes data dependence among tasks and synchronizes data, which reduces the time needed for parallel program developments. TOPAS also provides supports for scheduling, load balancing and fault tolerance. The main topics of this paper is to present the solution for transparent recovery of asynchronous distributed computation on clusters of workstations without hardware spare when a fault occurs on a node. Experiments show simplicity and efficiency of parallel programming in TOPAS environment with fault-tolerant integration, which provides graceful performance degradation and quick reconfiguration time for application recovery.
منابع مشابه
Integrating Fault-Tolerant Feature into TOPAS Parallel Programming Environment for Distributed Systems
In this paper, TOPAS1 a new parallel programming environment for distributed systems is presented. TOPAS automatically analyzes data dependence among tasks and synchronizes data, which reduces the time needed for parallel program developments. TOPAS also provides supports for scheduling, dynamic load balancing and fault tolerance. Experiments show simplicity and efficiency of parallel programmi...
متن کاملProcess Dependencies and Process Interference Rules for Analyzing the Impact of Failure in a Service Composition Environment
This paper presents a process dependency model for dynamically analyzing data dependencies among concurrently executing processes in an autonomous, distributed service composition environment. Data dependencies are derived from incremental data changes captured at each service execution site. Deltas are then used within a rule-based recovery model to specify how failure recovery of one process ...
متن کاملImplementation of the direction of arrival estimation algorithms by means of GPU-parallel processing in the Kuda environment (Research Article)
Direction-of-arrival (DOA) estimation of audio signals is critical in different areas, including electronic war, sonar, etc. The beamforming methods like Minimum Variance Distortionless Response (MVDR), Delay-and-Sum (DAS), and subspace-based Multiple Signal Classification (MUSIC) are the most known DOA estimation techniques. The mentioned methods have high computational complexity. Hence using...
متن کاملFault Tolerance Lessons Applied to Parallel Computing
This paper describes an approach to fault-tolerant parallel computing which is based on the experiences with the most successful fault-tolerant software – the transaction processing systems. The algorithms presented here have less runtime overhead and faster recovery than most preceding approaches. In the Pact parallel programming environment fault tolerance is provided fully user transparent i...
متن کاملA Novel Replication Technique For Detecting and Masking Failures For Parallel Software : Active Parallel Replication
We present a novel replication technique for parallel applications where instances of the replicated application are active on di erent group of processors called replicas. The replication technique is based on the FTAG (Fault Tolerant Attribute Grammar) computation model. FTAG is a functional and attribute based model [4]. The developed replication technique implements \active parallel replica...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002